Skip to content

Update README.md #61

Merged
merged 20 commits into from
Jan 13, 2019
Merged

Update README.md #61

merged 20 commits into from
Jan 13, 2019

Conversation

renewiegandt
Copy link
Collaborator

No description provided.

@JannikHamp
Copy link
Collaborator

Default von max bei motif_filtering bei mir ist 200, nicht 100. Hatte sich geändert.

@JannikHamp
Copy link
Collaborator

In the last code section, when you mention the rare moods bug, maybe add, that the pipeline (in step 1.2) checks if those files are empty and returns an error message if so. Or is this clear?

README.md Outdated
this can be done with following commands:
```
export PATH=[meme-suite instalation path]/libexec/meme-[meme-suite version]:$PATH
export PATH=[meme-suite instalation path]/bin:$PATH
```

Every other dependency will be automatically installed by Nextflow using conda. For that a new conda enviroment will be created, which can be found in the from Nextflow created work directory after the first pipeline run.
It is **not** required to create and activate the enviroment from the yaml-file beforehand.
3. Every other dependency will be automatically installed using conda. For that a conda enviroment has to be created from the yaml-file given in this repository.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Two whitespaces after 'automatically'.

README.md Outdated
Filter unknown motifs:
--min_size_fp INT Minimum sequence length threshold. Smaller sequences are discarded. (Default: 10)
--max_size_fp INT Maximum sequence length threshold. Discards all sequences longer than this value. (Default: 100)
--tfbsscan_method [moods|fimo] Method used by tfbsscan. (Default: moods)
Clustering:
Sequence preparation/ reduction:
--kmer INT Kmer length (Default: 10)
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

*K-mer length

README.md Outdated
Clustering:
--global INT Global (=1) or local (=0) alignment. (Default: 0)
--identity FLOAT Identity threshold. (Default: 0.8)
--sequence_coverage INT Minimum aligned nucleotides on both sequences. (Default: 8)
--memory INT Memory limit in MB. 0 for unlimited. (Default: 800)
--throw_away_seq INT Remove all sequences equal or below this length before clustering. (Default: 9)
--strand INT Align +/+ & +/- (= 1). Or align only +/+ (= 0). (Default: 0)
Motif estimation:
--min_seq INT Sets the minimum number of sequences required for the FASTA-files given to GLAM2. (Default: 100)
--motif_min_key INT Minimum number of key positions (aligned columns) in the alignment done by GLAM2. (Default: 8)
--motif_max_key INT Maximum number of key positions (aligned columns) in the alignment done by GLAM2.f (Default: 20)
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

GLAM2.f?

README.md Outdated
Clustering:
--global INT Global (=1) or local (=0) alignment. (Default: 0)
--identity FLOAT Identity threshold. (Default: 0.8)
--sequence_coverage INT Minimum aligned nucleotides on both sequences. (Default: 8)
--memory INT Memory limit in MB. 0 for unlimited. (Default: 800)
--throw_away_seq INT Remove all sequences equal or below this length before clustering. (Default: 9)
--strand INT Align +/+ & +/- (= 1). Or align only +/+ (= 0). (Default: 0)
Motif estimation:
--min_seq INT Sets the minimum number of sequences required for the FASTA-files given to GLAM2. (Default: 100)
--motif_min_key INT Minimum number of key positions (aligned columns) in the alignment done by GLAM2. (Default: 8)
--motif_max_key INT Maximum number of key positions (aligned columns) in the alignment done by GLAM2.f (Default: 20)
--iteration INT Number of iterations done by glam2. More Iterations: better results, higher runtime. (Default: 10000)
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Here is glam2 all lowercase before all uppercase.

README.md Outdated
When the environment is created, set the variable 'path_env' in the configuration file as the path to it.

For unknown reasons Moods, whisch is called by tfbsscan, rarely returns empty bedfiles. If this happens The pipeline stops.
The is no known fix sofar. If it happens try starting the pipeline again. If it still fails. Change the parameter tfbsscan_method to 'fimo'. This methods takes longer but will cause no error.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

*There is
*so far

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

As a workaround either restart the pipeline or change the parameter tfbsscan_method to fimo. (Instead of the third and second to last sentences).

@renewiegandt
Copy link
Collaborator Author

In the last code section, when you mention the rare moods bug, maybe add, that the pipeline (in step 1.2) checks if those files are empty and returns an error message if so. Or is this clear?

Can you give me the error message? then iI will add it

@renewiegandt
Copy link
Collaborator Author

The Parameter list is still not p to date. Please do not merge before it is.
I am going to update the list later this day.

@JannikHamp
Copy link
Collaborator

if [ $all_empty == true ]
then
echo ERROR
echo All motiffiles have less than 2 lines!
echo Fix motiffiles and try again.
exit 1
fi

Copy link
Collaborator

@SebastianBeyvers SebastianBeyvers left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We should reformat the Parameters page to one format.
And its called enviroNment with n :D

README.md Outdated
3. Every other dependency will be automatically installed using conda. For that a conda enviroment has to be created from the yaml-file given in this repository.
It is required to create and activate the enviroment from the yaml-file beforehand.
This can be done with following commands:
```condsole
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

console

README.md Outdated

**Important Note:** For conda the channel bioconda needs to be set as highest priority! This is required due to two different packages with the same name in different channels. For the pipeline the package jellyfish from the channel bioconda is needed and **NOT** the jellyfish package from the channel conda-forge!



## Quick Start
```console
nextflow run pipeline.nf --bigwig [BigWig-file] --bed [BED-file] --genome_fasta [FASTA-file] --motif_db [MEME-file] --config [UROPA-config-file]
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

organism ?!

README.md Outdated
Optional arguments:

--help [0|1] 1 to show this help message. (Default: 0)
--tfbs_path Path to directory with output from tfbsscan. If given tfbsscan will not be run.
--create_known_tfbs_path Path to directory where output from tfbsscan (known motifs) are stored.
Path can be set as tfbs_path in next run. (Default: './')
--gtf_path Path to gtf-file. If path is set the process which creats a gtf-file is skipped.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Arguments should be in one Format: With Datatype e.g. INT or without, not mixed within the document.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If its a list of strings which are allowed I prefer to write the list instead of 'STRING'. I could add 'STRING' to every path variable but I don't think that is necessary.

README.md Outdated
--motif_max_key INT Maximum number of key positions (aligned columns) in the alignment done by GLAM2.f (Default: 20)
--iteration INT Number of iterations done by glam2. More Iterations: better results, higher runtime. (Default: 10000)
--motif_max_key INT Maximum number of key positions (aligned columns) in the alignment done by GLAM2. (Default: 20)
--iteration INT Number of iterations done by GLAM2. More Iterations: better results, higher runtime. (Default: 10000)
--tomtom_treshold float Threshold for similarity score. (Default: 0.01)
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Datatypes all-caps or not -> only one format should be used

README.md Outdated
Start with installing all dependencies listed above (Nextflow, conda, MEME-Suite) and downloading all files from the [GitHub repository](https://github.molgen.mpg.de/loosolab/masterJLU2018).
It is required to set the [enviroment paths for meme-suite](http://meme-suite.org/doc/install.html?man_type=web#installingtar).
1. Start with installing all dependencies listed above (Nextflow, conda, MEME-Suite) and downloading all files from the [GitHub repository](https://github.molgen.mpg.de/loosolab/masterJLU2018).
2. It is required to set the [enviroment paths for meme-suite](http://meme-suite.org/doc/install.html?man_type=web#installingtar).
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

enviroNment not enviroment

README.md Outdated
this can be done with following commands:
```
export PATH=[meme-suite instalation path]/libexec/meme-[meme-suite version]:$PATH
export PATH=[meme-suite instalation path]/bin:$PATH
```

Every other dependency will be automatically installed by Nextflow using conda. For that a new conda enviroment will be created, which can be found in the from Nextflow created work directory after the first pipeline run.
It is **not** required to create and activate the enviroment from the yaml-file beforehand.
3. Every other dependency will be automatically installed using conda. For that a conda enviroment has to be created from the yaml-file given in this repository.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

enviroNment

README.md Outdated
Every other dependency will be automatically installed by Nextflow using conda. For that a new conda enviroment will be created, which can be found in the from Nextflow created work directory after the first pipeline run.
It is **not** required to create and activate the enviroment from the yaml-file beforehand.
3. Every other dependency will be automatically installed using conda. For that a conda enviroment has to be created from the yaml-file given in this repository.
It is required to create and activate the enviroment from the yaml-file beforehand.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

enviroNment

I think we needed the better explanation for the problem with MOODS
Copy link
Collaborator

@anastasiia anastasiia left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I added the better explanation for the issue with MOODS

Copy link
Collaborator

@anastasiia anastasiia left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

here are the changes for the known issue with moods

README.md Outdated Show resolved Hide resolved
Copy link
Collaborator

@SebastianBeyvers SebastianBeyvers left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks fine (for now :D)

@renewiegandt renewiegandt merged commit 9b849f2 into dev Jan 13, 2019
Sign in to join this conversation on GitHub.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

5 participants